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Abstract 

Given a sequence of n numbers, the Maximum Consecutive Subsums Problem(MCSP) 
asks for the maximum consecutive sum of lengths i for each i = 1,... ,n. No algorithm 
is known for this problem which is significantly better than the naive quadratic solution. 
Nor a super linear lower bound is known. The best known bound for the MCSP is based 
on the the computation of the (min, +)-convolution, another problem for which neither 
an 0(n^~’^) upper bound is known nor a super linear lower bound. We show that the two 
problems are in fact computationally equivalent by providing linear reductions between 
them. Then, we concentrate on the problem of finding super linear lower bounds and pro¬ 
vide empirical evidence for an r2(nlogn) lower bounds for both problems in the decision 
tree model. 


1 Introduction 

Given a sequence A = (oi, 02 , ... ,an) of n numbers, the MCSP asks to compute the sequence 
mi,..., rUn, where m^ = maxj=i^...„_^_|_i{ai-|-- • is the maximum over all consecutive 

subsums of length L 

The MCSP appears in several scenarios of both theoretical and practical interest like 
approximate pattern matching [^, mass spectrometry data analysis and in the problem 
of locating large empty regions in data sets [3]. Most work has been done for the case where 
the input sequence is binary, since in this case the MCSP coincides with the problem of 
constructing membership query indexes for jumbled pattern matching lamiii!- 

It is not difficult to come up with simple O(n^) solutions for the MCSP since each value 
mi can be easily computed in linear time by one pass over the input sequence. Surprisingly, 
despite the growing interest generated by this problem (see, e.g., [sisiiiniiiB], and references 
therein quoted), no solution is known with running time 0(n‘^~^) for some constant e > 0, nor 
a lower bound better than the trivial U(n) is known. 

Algorithms that produce approximate solutions for the MCSP are also known [TO]. How¬ 
ever, for the general case, the best available algorithm in the real RAM model runs in 0{v? j log n) 
mm and makes use of the algorithm proposed in [4] for computing a (min, -|-)-convolution. 
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The (min, +)-convolution problem is a natural variation of the classical convolution prob¬ 
lem; Given two sequences X = (xq, xi, ..., x„) and Y = {yo, yi,..., yn) of real numbers, the 
{min, +)-convolution of X and Y is the sequence z = zq, zi... Z 2 n, with z^ = min ^ {x* + 
yk-i}, for fe = 0,... ,2re. 

This problem has important applications in a variety of areas, including signal processing, 
pattern recognition, computer vision, and mathematical programming. According to [1], this 
problem has appeared frequently in the literature since Bellman’s early work on dynamic 
programming. 

Like for the MCSP, no strongly subquadratic algorithm appears to be known to compute 
the {min, +)-convolution. The best known algorithm for computing {min, +)-convolution runs 
in 0(n^/log n). Recently, Williams [19] has provided a Monte Carlo algorithm that computes 
the (min,+) convolution in time on the real RAM. Although, this bound is 

better than 0{n^/polylog{n)) it is still Lo{n‘^~'') for any e > 0. For the special case of monotone 
increasing sequences with elements bounded by 0{n) a recent breakthrough in [7] shows that 
{min, +)-convolution can be computed in time 0{n^'^^'^). This result implies an equivalent sub 
quadratic bound for MCSP on 0/1 sequences. 

Taking into account the apparent difficulty to devise an 0{n^~^) algorithm for the MCSP 
and for computing the {min, +)-convolution, a natural question to ask is whether there exists 
a non-trivial superlinar lower bound for these problems. 

The decision tree model is a widely used model of computation to study lower bounds for 
algorithmic problems. Fundamental algorithmic problems as searching, sorting and selection 
are examples of problems that have been studied in this model. In the decision tree model 
each algorithm can be represented by a decision tree, where internal nodes correspond to 
computations; the branches (edges) that leave an internal node v correspond to the possible 
results of the computation associated with v and the leaves correspond to the possible outputs 
of the algorithm (see section [3] for the exact definitions). Some variants/specializations of the 
decision tree model as linear decision trees and algebraic decision trees have also been used 
to study the complexity of algorithmic problems [2]. In |1] it is shown that the (mm,+)- 
convolution of two n-element vectors can be computed in 0(n^/^) in the non-uniform linear 
decision tree model. 

This work describes our findings in the quest for a super-linear lower bound for the MCSP 
in the decision tree model of computation. 

Our Contributions. We provide computational evidences that the running time of both 
MCSP and (min,-|-)-convolution problem is n(nlogn) in the decision tree model of computa¬ 
tion. 

We first establish a linear equivalence between both problems by showing a linear reduction 
from (min,-|-)-convolution to MCSPThe opposite direction was shown in |3|. As a result of 
this equivalence, any bound for one problem also holds for the other. Then, in the following, 
we only concentrate on the MCSP . 

In Section [S] we argue that a lower bound for the MCSP in the decision tree model can 
be obtained by generating a large set of inputs sequences such that no pair of them produces 
a common output. We present a construction which shows that there exists such a set of 
exponential size, although, this is still not enough to prove a superlinear lower bound on 
MCSP: 

In Section 0] by using a deterministic approach we show empirically that for n < 14 
there exists a set of inputs, with the above property, and cardinality larger than (n/2)! so 
that n/21og(n/2) is a lower bound on the depth of any decision tree that solves the MCSP 


2 


for instances of size n < 14. This required 27 hours of CPU time in our computational 
environment. In order to address larger values of n, we employed sampling strategies. We 
devised a hypothesis test and showed that the n/21og(n/2) lower bound also holds for any 
n < 100 with confidence much larger than 99.999%. 

Our results can bring new insight on the complexity of both the MCSP and the (min,+)- 
convolution problem and may represent a initial step towards proving a superlinear lower bound 
for these problems. Moreover, the techniques employed might be useful in the investigation of 
lower bounds for other computational problems with the same flavour. 

2 An Equivalence between MCSP and the (mm, +)-Convolution 
Problem 

We start by showing that MCSP and {min, +)-convolution are computationally equivalent. 

Theorem 1. There exist linear reductions between the MCSP and the {min, +)-convolution. 

Proof. The reduction from MCSP to {min, +)-convolution is observed in [3] where the latter 
problem is presented under the geometric naming Lowest Midpoints Problem. Thus, we just 
present the reduction from the {min, +)-convolution to MCSP. 

Let I = {X, Y) be an input for the {min, +)-convolution, where X = (xq, ..., x„) and Y = 
{uo, ■ ■ ■, Un)- Let 5 be a large enough number and define the input sequence A= (ai,..., 02^+4) 
for the MCSP as follows; a^+i = an+4 = S] an+3 = —yo', an+2 = —xq] for each z < n + 1 set 
Ui = Xn-i - Xn+i-i and for z > n + 4 set a* = yi-n-b - Vi-n-A '■ 


ai 

dn 

^n+1 

Un+2 

an+3 

^n+4 

^n+5 


U2n+4 

(^Xn—l Xn) * ‘ 

{xo-Xi) 

s 

-Xo 

-yo 

s 

0 

1 

{yi -y 2 ) ■■ 

{Vn-1 - Vn) 


Assuming S > Y17=o 1^*1 % 1^*11’ ^^7 fc > 4 a maximum sum of k consecutive elements 

includes On+i and On+A- Then, for each k > A, 


j+k—l n+A+s 

max > Qi = max > a* = max 2S — Xf — ys' = 

j=l,...,{2n+A)—k+l ^t 0<s,t<k—A . ^^ 0<s',t'<k—A 

s+t=k—A ^=n+l—t 

25- min {xf + yk-A-t') =‘^S - Zk-A- 

0 <t'<fc -4 

where, as above, zo,...,Z 2 n denotes the result of the convolution of X and Y. Therefore, 
for each /c = 0,..., 2n, we have the equivalence Zk = 2S — > where pk is the 

starting position of a maximum consecutive sum of length k in A. □ 

Due to these linear time reductions we can conclude that MCSP and {min, +)-convolution 
have the same time complexity. In the following, we will focus our discussion on lower bounds 
for MCSP and any conclusion reached will also hold for the {min, +)-convolution. 

3 Towards a Lower Bound for the MCSP 

In this section we discuss our approach to prove a lower bound for the MCSP. It will be 
convenient to employ the following alternative formulation of the MCSP 

Input. A sequence A = (oi,..., o„) of n real numbers; 
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Output. A sequence P = (pi,... ,Pn)-, where pi, for £ = 1,..., n, is the starting position of 
a consecutive subsequence of A that has maximum sum among the consecutive subsequences 
of A with length I, i.e., the subsequence Up^+i • • • ap.+£_i is a maximum consecutive subsum 
of length 

We call the sequence P an output configuration or simply a configuration. 

For example, for the input sequence A = (3,0,5,0,2,4) the only output configuration is 
given by the sequence P = (5, 5,1, 3, 2,1), which says, e.g., that there is a maximum consecutive 
subsum of length 4 starting at position 3. 

We note that there are n! possible configurations because pi, for z = 1,..., n, can assume 
any value in the set {1,... , n — z + 1}. In particular for the input A = (oi,..., a„,), with 
ai = 02 = • • • = On = 1, all the n! possible configurations are output configurations for A. 
This example also shows that some input sequences have more than one output configuration. 

We will study the above version of MCSP in the decision tree model. An algorithm in 
this model is a ternary tree. Each internal node I contains a test /(/) ^ 0? for some rational 
function / of n (size of the input) argument^. Each leaf of the tree contains a set of function 
gi {i = 1,... ,n) on the n-valued input. Eor any input A = (ai,..., Un), the algorithm moves 
from the root down the tree. At each node the corresponding test is performed and a branch 
is followed according to whether the test outcome is > 0, < 0, = 0. When a leaf is reached, 
the output P = {pi,... ,pn) is given by pi = gfiA). The cost of the algorithm is defined to be 
the height of the tree. The complexity K{n) in this model is the minimum cost of any such 
algorithm. We will be concerned with an information theoretic lower bound on K{n). 

3.1 An approach based on unique configurations 

Eor any input A for the MCSP we define V{A) = {P | P is an output conhguration for A}. 

We say that a configuration P is unique if and only if there exists an input A for the MCSP 
for which V{A) = {P}. 

Our approach to prove a lower bound on MCSP consists on finding a large set of distinct 
unique configurations. 

In fact, let P = {Pi, P 2 ,..., Pfc} be a set of distinct unique configurations. Moreover, let 
Ai, for z = 1,..., A:, be an instance for MCSP such that Pi is its unique configuration, i.e., 
V{Ai) = {Pi}. Then, the instances in the set A = {Ai,..., Ak] are distinct. Hence, in any 
decision tree, for any distinct inputs Ai,Aj, the leaves associated to the corresponding outputs 
must be distinct, hence the decision tree must have at least k leaves. Then, the height of 
the tree is at least [log 3 k '\, which shows that [log 3 k'] is a lower bound to the problem in the 
decision tree model. We have proved the following. 

Theorem 2. If P is a subset of distinct unique configurations, then K{n) = H(log |P|). 

We shall observe that if every configuration were unique, then we could prove a lower bound 
of H(nlogn) since there exists n! configurations of size n. However, there are configurations 
that are not unique like the configuration P = (1,2,1). In fact, assume that A = ( 01 , 02 , 03 ) 
is an input for which P is its unique configuration. Then, we would have both oi > 03 and 
02 + 03 > oi + 02 , which is not possible. 

The following construction shows that the number of unique configurations is indeed very 
large. 

all functions are restricted to be linear (resp. polynomials) the model is referred to as the linear (resp. 
algebraic) decision tree model. 
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Theorem 3. There exist 0(2"') unique eonfigurations. 
Proof. Fix a subset 5 C {4, 5,..., n} and define 


'0 iii = l 

2 if i = 2 

< 4n if z = 3 

3 if i > 4 and i ^ S 

_ 1 if i > 4 and i € S 



^n-j + 1 


if j < n — 2 and {j + 2) ^ S 
if j < n — 2 and {j + 2) € S 
if j > n — 1 


We first show that for the input ^4'^ = the unique output conhguration is 

= {pf,... ,p^). For this we need to verify that for each j = 1,... ,n, in ^ there is only one 
maximum consecutive subsum of length j and it starts at pj as given above. 

The statement is trivially true for j = n and for j = n — 1 since in the latter case it is 
enough to observe that 0 = ai < 02 . 

Since 03 = 4n > follows easily that for any l<A:<n —2a maximum consecutive 

sum of length k must contain the element 03 . Any such sum will then start at one of the first 3 
elements. However, since any element other than ai is greater than zero, it also follows that a 
maximum consecutive sum of size k G {l,...,n — 2} must start at 02 or 03 . In formula, for 1 < 
A; < n —2, let Sk denote the maximum consecutive sum of size k, then Sk G {^^=2 X]^= 3 ^ 

Let Afc be the difference between the two possible values for Sk. We have ~ 

a-i = 02 — afc +2 = 2 — ak+ 2 - Thus, A^ > 0 if /c G 5 and A*, < 0 otherwise, that is, 
Sk = Oj if ^ G S' and Sk = Oj if /c 0 S', which proves the above claim. The 

uniqueness of the configuration follows from the fact that all the inequalities in the above 
arguments are strict. 

Therefore, for each S C {4,..., n} we obtain an unique output configuration. Since there 
are 2"/8 distinct subsets of {4, ...,n} and each of them corresponds to a distinct unique 
configuration we have the desired result. 


□ 

The existence of at least exponentially many (in n) configurations supports our approach 
and is an indication of its potential. However, this result combined with Theorem [2] is still not 
enough to obtain a non trivial (superlinear) lower bound for the MCSP. This motivated us to 
enrich our analysis by empirically exploring the number of unique configurations. 
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4 Empirical evidences that MCSP requires Q(nlogn) time 

In order to count the number of unique configurations it is important to decide whether a 
given configuration P is unique or not. For that we test whether there exists an input sequence 
A = (oi,..., Qn) for which P is its unique output configuration. For i G {1,..., n — 1} and a 
configuration P = (pi,... ,Pn), let Q{P,i) be the following set of inequalities: 

Ofc > X] Ofc for j = 1 ,..., n - i + 1 and j / pi 

k=pi k=j 

It is easy to realize that the contiguous subsequence of length i that starts at position pi of 
A has sum larger than the sum of any other contiguous subsequence of length i if and only 
if the point A = (oi,... , 0 ^) € i?” satishes the above set of inequalities. Thus, P is a unique 
configuration if and only if the set of inequalities Q{P, 1) U Q{P, 2) U • • • U Q{P, n — 1) has a 
feasible solution. In our experiments we employed a linear programming solver to perform this 
verification. 

In order to speed up our computation we also employ a sufficient condition for the non¬ 
uniqueness of a configuration, which is provided by the following proposition. 

Proposition 1 (Non-Adjacency Property). Let P = {pi,... ,pn) be an output eonfiguration. 

If there exist i, j G {1,... , n} such that pj = pi + i, then P is not unique. 

Proof. Let i,j be such that pj = pi + i. In addition, assume that there is an input A for which 
P is its unique conhguration. This means that for any r ^ i and u ^ j we have 

r-l-i—1 pi-l-i—1 u+j —1 Pj+j — ^ 

^ ^ Os < ^ ^ Og ^ ^ Os < ^ ^ Os (I) 

s=r s=pi s=u 

Let m = min{z, j} and let’s split the sequence Op. Op.+i... ap.+j_i Op. Opj+i ... Op^.+j_i into 
three parts, and let Ri,R 2 ,R 3 be the intervals of indices of the elements in these three parts 
defined as follows: 

Ri = ■. .Pi+m-l},R 2 = {pi+m,pi+m+l,... pj+j-l-m}, R 3 = {pj+j-m,. ..pj+j-l}. 

Case 1. If i < 3 we have EsL^r^ + Esei ?2 TI’s=p]~^ = EseR 2 + 

Yls&Ri Then applying the second inequality in ([T|) we get 

Pi+j-i Pj+i-^ 

^ ^ Os -|- ^ ^ Os — ^ ^ Os < ^ ^ Os — ^ ^ Os “1“ ^ ^ Os, 

SG-R 2 ^~Pi ^~Pj S^R 2 

which implies that = Z^seRi ^ ^ Yfs^=Jj+j-i which is a contradic¬ 

tion to the first inequality in ([T]). 

Case 2. If i > j we have + EsGi ?2 + 

EsgRs Then applying the first inequality in ([1]) we get 

Pi+i-i Pj+j-^ 

'y ^ Os “1“ ^ ^ Os — 'y ^ ^ Os — ^ ^ Os -|- ^ ^ Os, 

sGi?i .SGR 2 «=Pi s=Pi+j seJ?2 seJJs 
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which implies that Y]s=l ^ = EseiJi > J^sgRs ^ ttg which is a contradiction 

to the second inequality in (fTj). 

Since in either case we have a contradiction, it follows that if i,j are adjacent maxima than 
P cannot be a unique configuration. □ 

For instance, consider the configuration P = (5, 1,3,4,1). By taking i = 2 and j = 3, we 
have Pi + i = Pj- Thus, we can conclude that P is not unique, as can be easily verified. 

Unfortunately, this non-adjacency property does not completely characterize the set of 
unique configurations because there exist configurations with no adjacent maximums that are 
not unique. For example, the configuration P = (2,4, 2,1, 2,1) has no adjacent maximums 
and is not unique. In fact, if ^ = (oi,... , 05 ) is an input sequence for which P is its unique 
configuration then we must have simultaneously: (i) 02 > 04 because of pi = 2; (ii) 04 -|- 05 > 
ai -|- 02 because of p 2 = 4; and (hi) ai -|- 02 + 03 -|- 04 > 02 + 03 -|- 04 -|- 05 because of p 4 = 1. 
However, this is impossible, since by summing the first two inequality and adding 03 on both 
sides, we obtain a contradiction to the third inequality. Nonetheless, we can use the above 
condition to speed up our algorithms. 

In order to exactly count the number of unique configurations, we explore the tree of all 
permutations of {1,... ,n}, pruning the nodes that do not lead to an unique configuration. 
In fact, assume that the values of the positions 1,2,... — 1 of the permutation P, under 

construction, are already fixed. Then, for each j that does not appear in the first i — 1 
positions, the procedure IsFeasible(P, i, j), explained below, is called to verify whether it is 
possible to extend the partial permutation P by setting the value of position i to j. If this is 
the case, the algorithm set pi = j and it proceeds constructing the permutations. Whenever 
we complete a permutation we increase the number of unique configurations. 

The procedure Is-Feasible(P, i, j) first verifies if the subsequence of A starting at posi¬ 
tion j is adjacent to some maximum subsequence that has already been fixed. If this test is 
positive it rules out j as a value for pi due to Proposition [H Otherwise, the procedure verifies 
whether the set of inequalities Q{P, 1) U • • ■UQ{P, i) is feasible and it returns TRUE or FALSE, 
accordingly. 

Algorithm 1 IsEeasible(P, i, j) 

1 : if the subsequence of length i starting at position j is adajcent to the subsequence of length k 

starting at pk for some k < i then 
2: return FALSE 

3: else 

4: Pi = j % this is only to verify if this extension if feasible. 

5: if the set of inequalities Q(P, 1) U • • • U Q(P, i)is feasible 

6: then return TRUE 

7: else return FALSE 


Table [U presents the results obtained by the deterministic approach. We were able to 
determine the number of unique configurations up to n = 14. The results suggest a super 
exponential growth. Indeed, notice the growth of the ratio between the number of unique 
configurations and (n/2)!. We selected the function (n/2)! because it is a simple function 
whose logarithm is 0 (nlogn). 

All the executions required 27 hours of CPU time under the following hardware and soft¬ 
ware specifications: Main Hardware Platform: Intel® Core^^ i7 3960X, 3.30GHz CPU, 32GB 
RAM, 64-bit; OS: Windows 7 Professional x64; Compiler: Microsoft®Visual C# 2010 Com¬ 
piler version 4.0.30319.1; Solver: Gurobi Optimizer. 
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Table 1: The number of unique configurations for n = 1,..., 14 compared to the value n/2!. 


n 

U(n) = N2 Unique Config. 

2 • 

Latin 

1 

1 

0.8 

1.25x 

2 

2 

1.0 

2 .00x 

3 

4 

1.3 

3.07x 

4 

12 

2.0 

6 .00x 

5 

36 

3.3 

10.90X 

6 

148 

6.0 

24.66X 

7 

586 

11.6 

50.51X 

8 

2,790 

24.0 

116.25x 

9 

13,338 

52.3 

255.02X 

10 

71,562 

120.0 

596.35X 

11 

378,024 

287.9 

l,313.03x 

12 

2,222,536 

720.0 

3,086.85x 

13 

12,770,406 

1,871.3 

6,824.34x 

14 

78,968,306 

5,040.0 

15,668.31x 


In order to extend our analysis to larger instances we then employed a probabilistic ap¬ 
proach. 

4.1 A Probabilistic Approach 

The first idea for estimating the number of unique configurations is to sample a large number 
M of configurations and test whether each of them is unique or not. The number of unique 
configurations found over M is an unbiased estimator for the number of unique configurations. 
With this approach we managed to obtain strong evidence of the super linear lower bound for 
n up to 28. To extend our range we followed a different approach. 

In the deterministic case, we explore the configuration space via a depth first search over 
the back-tracking tree of all possible configurations. In our probabilistic approach, presented 
in Algorithm O we randomly traverse a path in this tree that corresponds to a unique config¬ 
uration. Assume that we have already fixed the values for the positions 1 , 2 ,... , i — 1 of the 
configuration P that is under construction. Then, in order to set the value of pi, we construct 
a list S of all values j £ {1, ... ,n — i + 1} such that IsFeasible(P, i,j) returns TRUE. Let 
bi = IS"! be the branching factor of our path at level i. Then, we randomly choose one of the 
values in S for pi and continue to set the values of pj for j > i; if the method observes the 

n 

branching factors bi,b 2 , ■ ■ ■ ,bn, iri a root to leaf path, then it outputs A = as a guess for 

i=l 

the number of unique configurations. 

The value X can be used to estimate the number of unique configurations because X is 
a sample of a random variable X whose expected value E[X] is equal to the number of the 
unique configurations. In fact, let £ be a leaf located at depth n of the backtracking tree, that 
is, i corresponds to a unique configuration. The probability of reaching i in our random walk 
is where B{i) is the product of the branching factors in the path from the root of the 

tree to In addition, if £ is reached, the method outputs B{£). Let L be the set of leaves 
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Algorithm 2 BRANCHiNG-PRODUCT(n) 

1: X ^ 1 

2 : for z 1 to n do 

3: S' ^ 0 

4: for jX— 1 to n — i + 1 do 

5: if IsFEASiBLE(P,i, j) then 

6 : Add j to the list S of possible branchings 

7: if S is empty then 

8: return 0 

9: else 

10: X^Xx|S| 

11 : Pi ■(— value randomly selected from list S 

12: return X 


located at level n in the backtracking tree. Thus we have that 

£gL '' ^ 


After coming up with this approach, we found out that it had already been proposed to 
study heuristics for backtracking algorithms [HI 16]. 

We do not use directly the observed value X to estimate the number of unique configura¬ 
tions; instead, we assume as a null hypothesis that E[X], the number of unique configurations, 
is smaller than or equal to (|^!) and use the sampled value of X to reject this hypothesis with 
some level of confidence. 

Let c > 1. Under the hypothesis that E[X] < (§)!, using Markov’s inequality, it follows 
that: 


Pr 


X < 


X > C-! 

. “ 2 J 

1 

> 1 - 

c 


< Pr[X > cE[X]] < 


which implies that Pr 

Therefore, if we sample X and find a value larger than c^!, we can reject the hypothesis 
and conclude that the number of unique configurations is larger than S! with confidence of 

1 -i. 


C 


We can extend this approach by taking the maximum of k samples. Let Xi,... be 
the values for k samples of the random variable X. Then, with the hypothesis E[X] < and 
using the above inequality we have 


Pr 


n, 


max{Xi,... ,Xfc} < c-!j 


= Pr 


A 


(Xi<c 



IlPr 

2=1 


w < C-! 

2 J 



( 2 ) 

Thus, if one of the values Xi, X 2 ,..., X^ is grater than or equal to c§!, then we can reject 
the hypothesis and conclude that E[X] > with confidence (1 — 1/c)^ . 

In our experiments we sampled 1000 values of X for different values of n. The choice of 
1000 for k was to guarantee a reasonable CPU time. Let max{n, 1000) be the maximum value 
found in the 1,000 samples and let c„ = [max{n, 1000)/^!J. Table [5] shows the probability of 
Pr[max{Xi,..., Xiooo} < c„^!] assuming that E[X] < for configurations up to size n = 100. 
This probability also expresses our confidence to reject the hypothesis because in fact we’ve 
found a value equal to c„^!. Based on these results we state the following conjecture. 
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Table 2; Pr max{Xi,..., < Cn§! for/c = 1,000 


n 


Pr[max^{Xi} < c„f!] 

10 

6,048 

99.98346560846560% 

11 

23,760 

99.99579124579120% 

12 

38,880 

99.99742798353910% 

13 

439,296 

99.99977236305360% 

14 

558,835 

99.99982105636870% 

20 

372,252,672 

99.99999973136530% 

30 

102,827,922,078 

99.99999999902750% 

40 

4,680,410,711,674 

99.99999999997860% 

50 

69,590,788,615,385 

99.99999999999860% 

60 

562,841,769,233,371 

99.99999999999980% 

70 

136,904,322,455,757 

99.99999999999930% 

80 

87,399,891,508,924 

99.99999999999890% 

90 

73,279,283,017 

99.99999999863540% 

100 

204,252,401 

99.99999951040970% 


Conjecture 1. The running time of MCSP is n(nlogn) in the decision tree model. 
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